Flexible clustering via hidden hierarchical Dirichlet priors

نویسندگان

چکیده

The Bayesian approach to inference stands out for naturally allowing borrowing information across heterogeneous populations, with different samples possibly sharing the same distribution. A popular nonparametric model clustering probability distributions is nested Dirichlet process, which however has drawback of grouping in a single cluster when ties are observed samples. With goal achieving flexible and effective method both observations, we investigate prior that arises as composition two discrete random structures derive closed-form expression induced distribution partition, fundamental tool regulating behavior model. On one hand, this allows gain deeper insight into theoretical properties and, on other it yields an MCMC algorithm evaluating inferences interest. Moreover, limitations working more than populations consequently, devise alternative efficient sampling scheme, by-product, testing homogeneity between populations. Finally, perform comparison process provide illustrative examples synthetic real data.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Infinite Hidden Markov Models via the Hierarchical Dirichlet Process

Category: graphical models. In this presentation, we propose a new formalism under which we study the infinite hidden Markov model (iHMM) of Beal et al. [2]. The iHMM is a hidden Markov model (HMM) in which the number of hidden states is allowed to be countably infinite. This is achieved using the formalism of the Dirichlet process. In particular, a two-level urn model is used to determine the ...

متن کامل

Hierarchical Dirichlet Process Hidden Semi-Markov Models

Given a set of sequential data in an unsupervised setting, we often aim to infer meaningful states present in the data along with characteristics that describe and distinguish those states. For example, in a speaker diarization (or who-spoke-when) problem, we are given a single audio recording of a meeting and wish to infer the number of speakers present, when they speak, and some characteristi...

متن کامل

Flexible Priors for Exemplar-based Clustering

Exemplar-based clustering methods have been shown to produce state-of-the-art results on a number of synthetic and real-world clustering problems. They are appealing because they offer computational benefits over latent-mean models and can handle arbitrary pairwise similarity measures between data points. However, when trying to recover underlying structure in clustering problems, tailored simi...

متن کامل

Spike train entropy-rate estimation using hierarchical Dirichlet process priors

Entropy rate quantifies the amount of disorder in a stochastic process. For spiking neurons, the entropy rate places an upper bound on the rate at which the spike train can convey stimulus information, and a large literature has focused on the problem of estimating entropy rate from spike train data. Here we present Bayes least squares and empirical Bayesian entropy rate estimators for binary s...

متن کامل

Detecting Abnormal Events via Hierarchical Dirichlet Processes

Detecting abnormal event from video sequences is an important problem in computer vision and pattern recognition and a large number of algorithms have been devised to tackle this problem. Previous state-based approaches all suffer from the problem of deciding the appropriate number of states and it is often difficult to do so except using a trial-and-error approach, which may be infeasible in r...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Scandinavian Journal of Statistics

سال: 2022

ISSN: ['0303-6898', '1467-9469']

DOI: https://doi.org/10.1111/sjos.12578